170 research outputs found

    Spreading good news.

    Get PDF

    A statistical test for the difference in the amounts of DNA variation between two populations

    Get PDF
    A statistical test for the difference in the amounts of DNA variation between two populations is developed. The test statistic involves the covariance of the amount of variation between two populations, which is given by a function of their divergence time, T0. Accordingly, the power (rejection probability) of the test depends on T0. In this article, T0 is treated as unknown because it is very difficult to estimate. The test is most conservative when T0 = ∞ is assumed because the covariance is zero. If T0 = 0 is assumed, the largest value of the rejection probability is obtained. Thus, the test provides a range of rejection probability unless we have a reliable estimate of T0. The test is applied to the PgiC region in three mustard species: Leavenworthia stylosa, L. crassa and L. uniflora. The results of our test show that the level of variation in L. stylosa is significantly higher than those in the other species

    The Power of QTL Mapping with RILs

    Get PDF
    QTL (quantitative trait loci) mapping is commonly used to identify genetic regions responsible to important phenotype variation. A common strategy of QTL mapping is to use recombinant inbred lines (RILs), which are usually established by several generations of inbreeding of an F1 population (usually up to F6 or F7 populations). As this inbreeding process involves a large amount of labor, we are particularly interested in the effect of the number of inbreeding generations on the power of QTL mapping; a part of the labor could be saved if a smaller number of inbreeding provides sufficient power. By using simulations, we investigated the performance of QTL mapping with recombinant inbred lines (RILs). As expected, we found that the power of F4 population could be almost comparable to that of F6 and F7 populations. A potential problem in using F4 population is that a large proportion of RILs are heterozygotes. We here introduced a new method to partly relax this problem. The performance of this method was verified by simulations with a wide range of parameters including the size of the segregation population, recombination rate, genome size and the density of markers. We found our method works better than the commonly used standard method especially when there are a number of heterozygous markers. Our results imply that in most cases, QTL mapping does not necessarily require RILs at F6 or F7 generations; rather, F4 (or even F3) populations would be almost as useful as F6 or F7 populations. Because the cost to establish a number of RILs for many generations is enormous, this finding will cause a reduction in the cost of QTL mapping, thereby accelerating gene mapping in many species

    The Pattern of Polymorphism on Human Chromosome 21

    Get PDF
    Polymorphism data from 20 partially resequenced copies of human chromosome 21—more than 20,000 polymorphic sites—were analyzed. The allele-frequency distribution shows no deviation from the simplest population genetic model with a constant population size (although we show that our analysis has no power to detect population growth). The average rate of recombination per site is estimated to be roughly one-half of the rate of mutation per site, again in agreement with simple model predictions. However, sliding-window analyses of the amount of polymorphism and the extent of linkage disequilibrium (LD) show significant deviations from standard models. This could be due to the history of selection or demographic change, but it is impossible to draw strong conclusions without much better knowledge of variation in the relationship between genetic and physical distance along the chromosome

    ゲノムの脆弱部位を利用した適応進化

    Get PDF
    2015~2017研究成果の概要(和文):本研究では、バクテリアから高等生物まで、複数のゲノムを調べることによって、ゲノムの進化速度は一様ではなく、脆弱な速い進化を示す部分と、そうでない部分が存在することを示した。特に、寄生植物とウマを対象にした研究が成果をあげた。寄生植物は(特に完全寄生植物)、宿主ゲノムから柔軟に遺伝子を取り込み、自分のゲノムを進化させていることがわかった。その取り込んだ遺伝子は、ゲノム中に一様に分布しているのではなく、偏りが見られた。おそらく、脆弱性の高い部位に多く取り込まれたのであろう。ウマゲノムにおいても、家畜化において激しく変化したゲノム領域と、そうでない領域を特定した。研究成果の学術的意義や社会的意義脆弱部位は突然変異のホットスポットであるため、基本的には有害な物である。したがって、進化的に一時的に存在するだけで、すぐに淘汰され消滅するものであると考えられている。しかし、この概念は必ずしも正しくない。本研究では、バクテリアから高等生物まで、複数のゲノムを調べることによって、ゲノムの進化速度は一様ではなく、脆弱な速い進化を示す部分と、そうでない部分が存在するこを示した。研究成果の概要(英文):We aim to demonstrate the evolutionary importance of fragile sites. Fragilesites are hot spots of mutation, which are usually selected against. In this sense, they areevolutionary useless. Here, we hypothesized that a genome has some benefit if it has fragile sites,especially when the host species face environmental changes and has to adapt in a short time. Todemonstrate this, we investigated genomes of various species. For example, we found a number ofgenes have been integrated in the genome of parasite plants, and there seem to be hotspots ofintegration, where multiple genes are integrated (Kado Inann 2018 Genome Biol. Evol.). We alsoinvestigated the genome of domesticated horse, from which we found a number of regions that havechanged dramatically in the domestication process

    The rate and tract length of gene conversion between duplicated genes

    Get PDF
    Interlocus gene conversion occurs such that a certain length of DNA fragment is non-reciprocally transferred (copied and pasted) between paralogous regions. To understand the rate and tract length of gene conversion, there are two major approaches. One is based on mutation-accumulation experiments, and the other uses natural DNA sequence variation. In this review, we overview the two major approaches and discuss their advantages and disadvantages. In addition, to demonstrate the importance of statistical analysis of empirical and evolutionary data for estimating tract length, we apply a maximum likelihood method to several data sets

    Confounding factors in HGT detection: Statistical error, coalescent effects, and multiple solutions

    Get PDF
    Prokaryotic organisms share genetic material across species boundaries by means of a process known as horizontal gene transfer (HGT). This process has great significance for understanding prokaryotic genome diversification and unraveling their complexities. Phylogeny-based detection of HGT is one of the most commonly used methods for this task, and is based on the fundamental fact that HGT may cause gene trees to disagree with one another, as well as with the species phylogeny. Using these methods, we can compare gene and species trees, and infer a set of HGT events to reconcile the differences among these trees. In this paper, we address three factors that confound the detection of the true HGT events, including the donors and recipients of horizontally transferred genes. First, we study experimentally the effects of error in the estimated gene trees (statistical error) on the accuracy of inferred HGT events. Our results indicate that statistical error leads to overestimation of the number of HGT events, and that HGT detection methods should be designed with unresolved gene trees in mind. Second, we demonstrate, both theoretically and empirically, that based on topological comparison alone, the number of HGT scenarios that reconcile a pair of species/gene trees may be exponential. This number may be reduced when branch lengths in both trees are estimated correctly. This set of results implies that in the absence of additional biological information, and/or a biological model of how HGT occurs, multiple HGT scenarios must be sought, and efficient strategies for how to enumerate such solutions must be developed. Third, we address the issue of lineage sorting, how it confounds HGT detection, and how to incorporate it with HGT into a single stochastic framework that distinguishes between the two events by extending population genetics theories. This result is very important, particularly when analyzing closely related organisms, where coalescent effects may not be ignored when reconciling gene trees. In addition to these three confounding factors, we consider the problem of enumerating all valid coalescent scenarios that constitute plausible species/gene tree reconciliations, and develop a polynomial-time dynamic programming algorithm for solving it. This result bears great significance on reducing the search space for heuristics that seek reconciliation scenarios. Finally, we show, empirically, that the locality of incongruence between a pair of trees has an impact on the numbers of HGT and coalescent reconciliation scenarios

    QTL Map Meets Population Genomics: An Application to Rice

    Get PDF
    Genes involved in the transition from wild to cultivated crop species should be of great agronomic importance. Population genomic approaches utilizing genome resequencing data have been recently applied for this purpose, although it only reports a large list of candidate genes with no biological information. Here, by resequencing more than 30 genomes altogether of wild rice Oryza rufipogon and cultivated rice O. sativa, we identified a number of regions with clear footprints of selection during the domestication process. We then focused on identifying candidate domestication genes in these regions by utilizing the wealth of QTL information in rice. We were able to identify a number of interesting candidates such as transcription factors that should control key domestication traits such as shattering, awn length, and seed dormancy. Other candidates include those that might have been related to the improvement of grain quality and those that might have been involved in the local adaptation to dry conditions and colder environments. Our study shows that population genomic approaches and QTL mapping information can be used together to identify genes that might be of agronomic importance

    mbs: modifying Hudson's ms software to generate samples of DNA sequences with a biallelic site under selection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The pattern of single nucleotide polymorphisms, or SNPs, contains a tremendous amount of information with respect to the mechanisms of the micro-evolutionary process of a species. The inference of the roles of these mechanisms, including natural selection, relies heavily on computer simulations. A coalescent simulation is extremely powerful in generating a large number of samples of DNA sequences from a population (species) when all mutations are neutral, and Hudson's <b>ms </b>software is frequently used for this purpose.</p> <p>However, it has been difficult to incorporate natural selection into the coalescent framework.</p> <p>Results</p> <p>We herein present a software application to generate samples of DNA sequences when there is a biallelic site targeted by selection. This software application, referred to as <b>mbs</b>, is developed by modifying Hudson's <b>ms</b>. The <b>mbs </b>software is so flexible that it can incorporate any arbitrary histories of population size changes and any mode of selection as long as selection is operating on a biallelic site.</p> <p>Conclusion</p> <p><b>mbs </b>provides opportunities to investigate the effect of any mode of selection on the pattern of SNPs under various demography.</p
    corecore